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ON REPEATED ZERO-SUM GAMES WITH INCOMPLETE 
INFORMATION AND ASYMPTOTICALLY BOUNDED VALUES 

Fedor Sandomirskiy^’^ 


To the memory of Victor Domansky 

Abstract. We consider repeated zero-sum games with incomplete information on the 
side of Player 2 with the total payoff given by the non-normalized sum of stage gains. 

In the classical examples the value Vn of such an A-stage game is of the order of N or 
Vn as a — >• oo. 

Our aim is to find what is causing another type of asymptotic behavior of the value Vn 
observed for the discrete version of the hnancial market model introduced by De Meyer 
and Saley. For this game Domansky and independently De Meyer with Marino found 
that Vat remains bounded as A —>■ oo and converges to the limit value. This game is 
almost-fair, i.e., if Player 1 forgets his private information the value becomes zero. 

We describe a class of almost-fair games having bounded values in terms of an easy- 
checkable property of the auxiliary non-revealing game. We call this property the piece- 
wise property, and it says that there exists an optimal strategy of Player 2 that is 
piecewise-constant as a function of a prior distribution p. Discrete market models have 
the piecewise property. We show that for non-piecewise almost-fair games with an addi¬ 
tional non-degeneracy condition Vn is of the order of a/A. 


1. Introduction 

The theory of repeated games with incomplete information originated in the early sixties 
from reports of Aumann and Maschler to the United States Arms Control and Disarma¬ 
ment Agency (the reports were published as a book: Aumann and Maschler (1995) [1]). 
Their goal was to develop a game-theoretical framework for repeated disarmament nego¬ 
tiations between the USSR and USA. The main feature of that interaction was a strategic 
usage of information in a dynamic framework, that is, when selecting an action now one 
has to care about information revealed by one’s action and its affect on future behavior 
of the opponent. 

It turned out that the simplest zero-sum case with only one side having private infor¬ 
mation is already very nontrivial. For the introduction to the theory of such games we 
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2 REPEATED GAMES WITH INCOMPLETE INFORMATION AND BOUNDED VALUES 

refer to Aumann and Maschler (1995) [1], Zamir (1992) [35], Sorin (2002) [31], and to 
Mertens, Sorin and Zamir (2015) [26] for further reading. 

In this paper we consider the classical setting mentioned above: zero-sum repeated 
games with incomplete information on one side. In these games two players repeatedly 
play the same zero-sum game which is selected by chance before the hrst stage of a 
multistage interaction according to a prior distribution known to both players. The game 
selected is told to Player 1 only, so he knows the game he is playing while his uninformed 
opponent does not know. Multistage setting gives to the uninformed player an opportunity 
to guess what is the actual stage game by observing previous actions of Player 1. In turn, 
informed player has to balance between two sometimes opposite goals: to beneht from 
his private information at a current stage and to avoid fast revelation so as to be able 
to beneht in the future. Complexity of optimal strategies usually prevents their explicit 
description, and forces qualitative and asymptotic analyses to be the main tools to study 
such games. 

One of the main asymptotic problems is to describe the behavior of the value Vn of 
an A^-stage game for large N. This problem has attracted much attention because the 
asymptotic behavior of the value is related to the beneht that the informed player can 
get from his information. 

Throughout the paper we assume that the total payoh equals to the sum of stage payohs 
(i.e., there is neither discounting nor normalization). Then, as it was shown by Aumann 
and Maschler, under some technical assumptions V/v grows linearly up to an error term 
bounded by a constant times \fN (Aumann and Maschler (1995) [1]; Gensbittel (2015) [18] 
and Neyman (2013) [27] for recent extensions). 

The linear component disappears if we assume that all the strategic advantages or 
disadvantages in a repeated game arise from information asymmetry. Formally this means 
that the auxiliary one-stage game where nobody is informed (the so-called non-revealing 
game) is fair for any prior distribution, i.e., has zero value. We call repeated games with 
this property almost-fair. For almost-fair games the value Vn can be interpreted as the 
value of information possessed by Player 1, and it coincides with the error term and thus 
does not exceed \/N by the order of magnitude. The growth is sublinear because Player 1 
loses the information advantage from stage to stage revealing his private information by 
past actions. 

The paper deals with the problem of hnding a relation between the asymptotic behavior 
of the value Vn of an almost-fair repeated game and the strategic properties of a non¬ 
revealing game. Importance of such a relation comes from the fact that the non-reveling 
game is usually easy to solve. Our main results are easily checkable sufficient conditions 
for the two kinds of asymptotic behavior of the value: being asymptotically bounded 
and \/]V-growth. The conditions are nearly opposite of each other, and hence our results 
provide an almost complete characterization of the possible asymptotic behaviors. This 
gives a general explanation of many results from the literature. 

For a long time all known almost-fair games had \/]V-behavior of the value. Almost all 
methods in the theory of repeated games with incomplete information originated as tools 
to analyze games of this class. The hrst example was constructed by Zamir (1971) [34] 
(we include it in Subsection 1.1). Mertens and Zamir (1976, 1995) [23, 25] proved that for 
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a class of games containing Zamir’s example Vn/VN converges to a limit related to the 
normal distribution. They used a very technical analysis of a recurrent equation for the se¬ 
quence of values Vn- De Meyer (1996A, 1996B) [2, 3] introduced a duality approach which 
allowed to extend the results of Mertens and Zamir to a broader class of games without 
dealing with any technicalities. He obtained two representations of the V^/ limit: one 
from the Central Limit Theorem and one as a solution of a partial differential equation. 
Other results were based on hnding explicit solutions (see Heuer (1991) [20], Domansky 
and Kreps (1994, 1995, 1999) [12, 13, 14]). Similar problems of dynamic strategic use of 
information also demonstrate \/]V-behavior of the value. Mertens and Zamir (1977) [24] 
and De Meyer (1998) [4] studied martingale optimization problem of the maximal L}- 
variation of a bounded martingale which is connected to the optimal speed of information 
revelation in games with \/]V-behavior (Gensbittel (2015) citeCavU showed that any game 
can be reduced to a generalized problem of the maximal variation). Problems arising as 
continuous-time limits of \fN-gam.es were discussed by De Meyer (1999) [5] and Gensbit¬ 
tel (2013) [19] (working with such limit problems is known as “compact approach”, see 
Sorin (2002) [31]). 

An example of another asymptotic behavior of Vn is given by a hnancial market model 
introduced by De Meyer and Saley (2003) [6] to analyze impact of information asymmetry 
on a hnancial market. In this model a risky asset is exchanged between two agents; one of 
them is an insider and knows the actual value of a risky asset, and the other does not (see 
Subsection 1.1 for the dehnition). Depending on existence of the minimal currency unit 
the corresponding almost-fair game demonstrates either asymptotically bounded value 
or Vn of the order of \/iV, though the model is robust to change of other modeling 
assumptions: 

• If there is no minimal currency unit De Meyer and Saley showed that the value has 
\/]V-behavior and the limit price-process is related to the Brownian motion. Prices 
are the elements of strategy, and the authors argue that their result provides en¬ 
dogenous justihcation for the Brownian motion in hnance. The informal reason for 
the appearance of the Brownian motion is that the insider can beneht with reveal¬ 
ing an arbitrary small “portion of information” using an arbitrary small change 
of his price. As N ^ oo optimal “portions” become inhnitesimally small, and 
the Brownian motion arises from the Gentral Limit Theorem, and y/N originates 
from the standard normalization. Various extensions of this model (De Meyer and 
Marino (2004) [7], De Meyer (2010), De Meyer and Fournier (2015) [10]) have the 
same properties. 

• Domansky (2007) [11] and independently De Meyer with Marino (2005) [8] showed 
that introducing the minimal currency unit radically changes the behavior of the 
model: the value Vn becomes bounded as V —)■ cxo. The informal explanation is 
that the minimal currency unit does not allow insider to obtain non-zero proht 
revealing a small “portion of information” because he has to change his price 
signihcantly: at least by one currency unit. This forces him to reveal information 
fast and to lose the information advantage in a hnite number of steps collecting 
only a bounded total gain. This informal explanation is supported by results of 
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Domansky [11] who explicitly solved the inhnite-stage version of the game and 
showed that the optimal strategy of the insider reveals his private information 
in hnite number of stages and that the price-process is a simple random walk 
over the lattice of admissible prices with absorption at the true value of the asset. 
Asymptotic boundedness and fast revelation holds for different modihcations of the 
model with minimal currency unit (Domansky and Kreps (2009, 2013, 2016) [15, 
16, 17], Sandomirskaia (2016) [29]). 

The market model with minimal currency unit was historically the second non-trivial 
example of an almost-fair game with bounded values. However, the hrst such example 
from (Domansky and Kreps (1994) [12]) is similar to market model with three admissible 
bids (see Subsection 1.1 for details). 

Our research was inspired by the desire to hnd an abstract property responsible for the 
effect of Vn being bounded in an almost-fair game. Theorem 2.1 shows that the value is 
bounded if in the non-revealing game Player 2 has an optimal strategy that is piecewise- 
constant as a function of the prior distribution. This is what we call the piecewise property. 
The market models with minimal currency unit are piecewise games (Example 2.2) thereby 
Theorem 2.1 immediately implies many results cited above. Theorem 2.2 shows the result 
is sharp: if for some interval of prior distributions the optimal strategy of Player 2 is 
unique and non-constant, then the value is of the order of \fN. Remark 2.5 compares 
this condition with the weakest known sufficient condition from De Meyer (1996A) [2]. 
Up to an additional assumption of uniqueness the theorems give a characterization of 
possible asymptotic behaviors of Vat for almost-fair games. These results are stated in 
Section 2. The proofs explain the strategic origin of the two behaviors: we formalize the 
intuition that relates the behavior of Vn with an opportunity to beneht from an arbitrary 
small revelation of information (see above and Propositions 3.1 and 4.1). The proof of 
Theorem 2.1 is contained in Section 3. The approach is based on ideas of De Meyer and 
Marino [8] that link upper bounds on the value with invariant functions of the recurrent 
equation and on the explicit construction of an invariant function using geometry of 
the Kantorovich metric. Theorem 2.2 is proved in Section 4. For this purpose we pass 
from the game to a martingale optimization problem of generalized maximal variation 
(Gensbittel (2013) [18]). By analyzing optimal strategies under small perturbations of 
matrix games we show that the value of the martingale optimization problem is bounded 
from below by a quantity related to the typical movement of a simple random walk after 
N steps that is of the order of . The last Section 5 presents a list of open problems. 

1.1. Notation, definitions, classic results, and examples. In N-stage zero-sum 
game Pat = Pat (p) with incomplete information on the side of Player 2 the players repeat¬ 
edly play the same / x J matrix game depending on a random state k & K. The state 
k is selected by chance before the hrst stage from the set of states K according to a prior 
distribution p G A(iP) known to players (A(iP) denotes the set of all probability measures 
over K). Player 1 knows the realization of /c, but Player 2 does not. The players choose 
their actions in ^ I and jn G J at a stage n = 1, 2, ..iV taking into account their current 
knowledge: the history of their actions hn = (known to both) and knowledge of 

k (for Player 1). Stage payoffs A^^ -^ are not observed during the game. The objective of 
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Player 1 (2) is to maximize (minimize) the expected total payoff^ Gn = 


E 


2-^n=l 


f'Ti ijn 


Players can randomize their actions using behavioral strategies (dehned below), and so 
the expectation is taken with respect to the joint distribution of k and /iat+i. 

A behavioral strategy a of Player 1 is a collection {an}n=i, where an{hn,k) G A(J) is 
the distribution used by Player 1 to randomize his action depending on his “knowledge” 
{hn, k) at a stage n. In a behavioral strategy r = of Player 2 the distribution 

'Tn{hn) G A(J) used to select does not depend on the state k as Player 2 does not know 
it. Note that = Giy{a,T) represents the game Pat in normal form. By Kuhn’s the¬ 
orem behavioral strategies are equivalent to mixed strategies if deterministic behavioral 
strategies (i.e., such that cr„ and take values in the Dirac ^-measures over I and J, 
respectively) are considered as pure. Hence under the standard hniteness assumption of 
J, J, and K the min-max theorem applies. Therefore the game P n{p) has a value Vv(p) 
and players have optimal strategies. The game also has a value for compact metric spaces 
/, J, and K with continuous A by the standard approximation arguments: optimal strate¬ 
gies in the game with discretized sets of actions and states induce e-optimal strategies in 
the original one, and e goes to zero as discretization becomes hner (see Remark 3.2 for 
an example of using such techniques). 

The non-revealing game P^^(p) is a version of the one-stage game Pi(p) where both 
players do not know the state k. This game is equivalent to I x J matrix game with 
expected payoff matrix A^ = EH^. Hence A^ = hnite K (here pi = P({fc = 

/}) is the weight that p gives to an element I G K). The value of the non-revealing game 
is traditionally denoted by u{p). 

The Cav [wj-theorem of Aumann and Maschler shows that the the non-revealing game 
is responsible for the leading term of repeated game’s value as N ^ oo\ 


(1.1) Vn{p) = N ■CaY[u]{p) + 0{\fN), N —)■ CXO, 


in the case of hnite /, J, and K. Here Cav [m] denotes the least concave majorant of u 
treated as a function A(JP) —)■ M. 

We say that a repeated game Pv is almost-fair if the value u{p) of the non-revealing 
game is zero for any p G A(iP). In other words, almost-fair game is a game that becomes 
fair if Player 1 forgets k. By (1.1) the value of an almost-fair game can not grow faster^ 
than \fN. 

The following almost-fair games will be used to illustrate our results: 


Example 1.1 {The first game with Vn of the order ofy/N). Zamir (1971) [34] showed that 
for the repeated game with J = J = JP = {0,1} and stage games given by 


( 1 . 2 ) 


= 


3 -1 

-3 1 


and A^ = 


2 -2 

-2 2 


^Usually one considers the expected average total payoff, i.e., the expected sum of stage gains divided 
by TV, to ensure that the sequence of values IZv is bounded as TV —>• oo. However, we do not follow this 
convention as Vn remains bounded in the games we are interested in without any normalization. 

^For almost-fair games with infinite I, J, and K the value can also grow as TV“ with a S (0.5,1), see 
Sandomirskiy (2014) [30]. 
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the value grows as \fN. Checking that the non-revealing game -|- piA^ has 

zero value is left to the reader. 

Example 1.2 {The first game with bounded V^). A repeated game is called flat if u{p) 
is an afhne function of p. Domansky and Kreps (1994) [12] explicitly solved all flat 
(“eventually revealing” by their terminology) games with / = J = iC = {0,1}. They 
discovered that except for games similar to (1.2) this class contains almost-fair games 
with bounded values: if stage games are 

(1,3) T=([J and T = ([“' (1-a))' 

then the value Vn{p) does not exceed 1/2 for any N and p. If a = 0, the game T 7 v(p) is 
especially simple: the optimal strategy of Player 1 does not depend on n and prescribes to 
play “top” if fc = 0 and “bottom”, otherwise. This strategy completely reveals k after the 
hrst stage and leads to Vn{p) = max{po,Pi} for any N. For a > 0 the optimal strategy 
becomes n-dependent. Note that the degenerate case of a = 0 is equivalent to the discrete 
market model with m = 2 described in Example 1.3. 

Example 1.3 {The financial market model with asymmetric information). A model of 
De Meyer and Saley (2003) [6] has I = J = [0,1], the set of states K = {0,1}, and 
the stage games given by A^j = sgn {i — j){k — max{h j})- This repeated game can be 
interpreted as follows. The state /c is a liquidation value of a risky asset, and only Player 1, 
the insider, knows k. At each stage of P m{p) both players propose their prices in and 
for the asset, and the player with higher price buys one unit of the asset from his opponent 
for this price. Players have enough assets and money. The objective of both players is 
to maximize their expected welfare after N trading rounds. This game is almost-fair (by 
symmetry) and has \/]V-behavior of the value. We refer to this model as the continuous 
one. 

Introducing the minimal currency unit ^,rn G N, leads to the discrete version of the 
model with I = J = |0, Por the discrete model the value Vn remains bounded 

as N ^ oo, see Domansky (2007) [11] and De Meyer with Marino (2005) [8]. 

2. Results 

The following property turns out to be responsible for the sequence of values being 
bounded as N ^ oo. 

Definition 2.1. We say that a repeated game P n with incomplete information is a piece- 
wise game if there exists a function y* : A{K) —)■ A( J) taking a hnite number of different 
values and such that v*{p) is an optimal strategy of Player 2 in the non-revealing game 

pNR(^) p ^ 

Theorem 2.1. Let Ptv be an almost-fair piecewise repeated game with incomplete infor¬ 
mation and finite /, J, and K. Eor any p G A(iP) and N > 1 

(2.1) 0 < VM < WMiipQ, 

where ||A||iip = maxij^k^k' and Q is the number of different values that y* takes 

(see Definition (2.1)). 
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Remark 2.1. Note that only the upper bound in (2.1) is non-trivial since playing his 
optimal strategy for at all stages of T]\f{p), Player 1 guarantees expected payoff of 

at least 0. 


Corollary 2.1. Under the conditions of Theorem 2.1 the sequence Vn{p) has a finite 
limit Vooip) as N ^ oo. Indeed, this sequence is bounded and non-decreasing (if at the 
first stage of T Player 1 plays the strategy optimal in P^^(p) and then plays as if 
the remaining game is PAr(p), he gets at least Uv(p)). 

Remark 2.2. Theorem 2.1 can be easily extended in two ways: 

• The result holds for flat games (games with affine u{p) = 

put the error term U/v(p) — Nu{p) instead of V^ip) in (2.1). This can be deduced 
from strategic equivalence of any flat game to an almost-fair game with stage 
payoffs given by — val)^^]. 

• The result holds for games with arbitrary compact metric space iP, hnite /, J, and 
continuous A without any changes. Indeed, the right-hand side of (2.1) does not 
explicitly depend on cardinality of K that allows to use standard approximation 
techniques. For the formal argument see Remark 3.2. 


Example 2.1. Let us check that the game from Example 1.2 is a piecewise game. In a 
non-revealing game = PqA^ + PiA^ the optimal strategy of Player 2 is to play “left” if 
Pi > po and play “right” if pi < Po- If pi = po any convex combination of the two actions 
is optimal. So 


y*{p) = 

does the job (here and below da denotes the Dirac ^-measure concentrated at a). 




Pi > Po 
Sr, Pi < Po 


Example 2.2. It is easy to check that the discrete market model (see Example 1.3) has 
the piecewise property for all m. If nobody knows fc, and the probability pi = P({/c = 1}) 
belongs to the interval for some q = 0,1, ..m — 1, then for both players it is 

optimal to select i = j = Hence the pure strategy y*{p) = S [p,m] guarantees 0 in the 

m 

non-revealing game ([x] denotes the integral part of a; G M). Informally, the reason is 
that, if nobody knows the value /c of a risky asset, the optimal price in the continuous 
model would be the expectation Efc = pi, but in the discrete one only discrete prices are 
allowed, and hence players select the closest point to pi. So the convergence results from 
Domansky (2007) [11] and De Meyer with Marino (2005) [8] become immediate corollaries 
of Theorem 2.1. 

For the continuous market model (recall that it has Vjy of the order \/N) the optimal 
strategies in the non-revealing game are i = j = pi- They are unique and depend 
continuously on the prior p. This suggests a form of converse to Theorem 2.1. 


Let [p'^p”] denote the segment {ap' -|- (1 — a)p'' \ a G [0,1]} for p',p'' G A(JF). By 
Ar®iin*(iF) we denote the relative interior of A{K), i.e., the set of all p G A(JF) such that 
Pk > 0 for any k G K. 


Theorem 2.2. Let Fw be an almost-fair repeated game with incomplete information and 
finite I, J, and K. Suppose that there exists a segment [p',p"] such that for all p G [p',p"] 
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the optimal strategy y*{p) of Player 2 in the non-revealing game is unigue, but 

y*{p) takes infinitely many different values when p ranges over [p',p"]. Then for any 
p e 

(2.2) Ciy/N < Vn{p) < C2^/N, N > 1, 

where Ch = Chip), h = l,2, are positive constants independent of N. 

Remark 2.3. The upper bound in Theorem 2.2 follows from the estimate (1.1) of Aumann 
and Maschler. The lower bound shows that the piecewise property is almost a criterion 
of boundedness. We expect that the piecewise property is a criterion, but the role of 
uniqueness assumption in Theorem 2.2 remains a question for future research. 

Example 2.3. Zamir’s game (Example 1.1) fulhlls the assumptions of Theorem 2.2. The 
optimal strategy y*ip) of Player 2 in the non-revealing game poAfi + piAi is unique and 
equals H continuously depends on p and takes inhnitely many values on any 

segment \p',p"] with p' ^ p". Hence Theorem 2.2 implies \/]V-growth of V/v(p). 

Remark 2.4. Theorem 2.2 can not be directly applied to the continuous market model 
(Example 1.3) because hniteness of I and J is important for the proof. 

Remark 2.5. Assumptions of Theorem 2.2 are weaker than the condition for the '/N- 
behavior by De Meyer (1996A) [2], the weakest condition from the literature. Under the 
additional assumption that certain partial-differential equation has regular solutions, he 
claims that the limit of Vn/\/N as N ^ oo exists for an almost-fair game if (1) all the 
sets I,J, and K are hnite (2) both players have equal number of actions, i.e., |/| = \J\ 
(3) in the non-revealing game Player 1 has unique optimal strategy x* independent of 
p G A(A'), and this strategy is completely mixed. Let us check that Theorem 2.2 applies 
to any such game except the degenerate case with Uv(p) = 0 for any N and p. Kaplansky 
lemma (see De Meyer (1996A) [2], Lemma 4.1) implies that the optimal strategy y*ip) of 
the second player is unique and completely mixed and rationally depends on elements of 
A^. Therefore y*ip) either fulhlls the assumptions of Theorem 2.2 or does not depend on 
p at all. In the last case Uv(p) = 0 because Player 2 defends zero by playing y* at any 
stage of Pv(p). 

Let us modify Zamir’s example (Example 1.1) by adding a hxed convex combination of 
rows to both matrices A^ as a new pure strategy of Player 1. This new game is strategically 
equivalent to the initial one and has Viv of the order of '/N that can be deduced also 
from Theorem 2.2, but De Meyer’s result becomes inapplicable because |/| \J\- So 

Theorem 2.2 gives a wider class of \/]V-games, but existence of the limit lim^v^oo Vn/VN 
is an open problem for this class. 

3. Recurrent equation, the Kantorovich metric, and proof of 

Theorem 2.1 

There are two different approaches to investigate repeated games with incomplete in¬ 
formation: from the perspective of the informed player and from the perspective of his 
non-informed opponent. The hrst one leads to a martingale-optimization problems that 
arise in selecting the optimal rate of revealing information. The second one, the dual 
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approach, is based on the concept of dual game introduced by De Meyer and on its re¬ 
current structure. The problem of hnding the optimal strategy of Player 2 has a flavor 
of multi-criteria optimization: such a strategy does not depend on the state k but has to 
be efficient for any k E K. This is made precise in an application of the approachability 
theory studying games with vector payoffs to the behavior of uninformed player in long 
games with incomplete information. 

Our proof of Theorem 2.1 is based on the hrst approach^. The scheme resembles the 
one used by De Meyer and Marino (2005) [8] to derive an upper bound for the value of 
the discrete market model. We start from the recurrent equation Vn+i{p) = T[Vn]{p) 
in the form from Gensbittel (2015) [18] and construct an explicit non-negative invariant 
function h of the Shapley operator T using the Kantorovich metric. Monotonicity ideas 
of De Meyer and Marino imply that h is an upper bound for Vat. This lets us almost 
avoid strategic analysis of the game but nonetheless get an explicit upper bound. 


3.1. Recurrent equation. Given a strategy a of Player 1 the process of information 
revealing by his actions is described by a sequence of posterior distributions G A(iP) 
of /c at a stage n, i.e., are dehned as the conditional probability oi k = I given hn- 
Posterior distribution of k can be treated as a dynamic state variable of the game from the 
Player’s 1 point of view since p^^^ represents the beliefs of Player 2 about /c at a current 
stage. Denote the sequence of random variables p^^\p^‘^\ by pG>i)_ 

The sequence of random variables is called a martingale of length N 

adapted to the natural hltration (hereafter, a martingale) if the conditional expectation 
E I equals for all n = 1, ..N - 1. 


^I am grateful to Eilon Solan for telling me about the paper of Manner and Perchet (2013) [22] 
which allows to use the second approach to prove boundedness of Vn- They studied fast convergence of 
Blackwell’s approachability procedure for repeated games with vector payoffs and found that, if a target 
set i? is a polytope approachable by a finite number of pure actions, then the approaching player has 
a strategy r such that for any strategy cr of the opponent and any N the expected Euclidean distance 
between the average payoff qn after N rounds and B is bounded from above by ^ with some constant 
C. 

Let us apply this result to show boundedness of Vn in almost-fair piecewise games using the standard 
link between behavior of Player 2 and the approachability (see Mertens, Sorin, and Zamir (2015) [26], 
Section V.2.C.). It is enough to show that for any p G A(iy) in Mstage game the uninformed player has a 
strategy r such that for any strategy a of Player 1 the expected distance between the normalized vector 
payoff pn = J2n=i ^ target set B = {v G | u/ < 0 VZ £ K} is bounded 

by Let us call a game pure-piecewise if piecewise condition is fulfilled in pure actions (i.e., y* takes 
values in J, not in A(J)). For pure-piecewise games B is approachable in pure actions, and the result of 
Mannor and Perchet applies and provides boundedness of Vn- 

It remains to check that any piecewise game can be reduced to a pure-piecewise. Indeed, define a new 
set of actions of Player 2 by J = JUy*{A{K)) (we add Q actions) and the stage game by = Afj for 
i G I,j G J and A^ y = ^j^jyjA^j for y G y*{A{K)). The new repeated game is pure-piecewise because 
playing mixed action y = y* {p) becomes equivalent to selecting pure action y G J. The values of both 
games coincide since addition of a convex combination of rows as a new row to a matrix game does not 
change its value. 
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The process of posterior distributions is a martingale of length + 1 with values 

in A{K) and with non-random= p (see Mertens, Sorin, and Zamir (2015) [26], Section 

V.2.a.). 

Let Mp denote the set of all A(iL)-valued martingales of inhnite length with = p. 
Formally, elements of Mp are pairs consisting of a probability space and a martingale 
dehned on this space. 

From the early works of Zamir and Mertens the recurrent equation for the sequence of 
values is the central tool to study asymptotic behavior of the value. Gensbittel (2015) [18] 
represented the recurrent equation as a martingale optimization problem where Player 1 
decides how much information to reveal at the hrst stage by selecting the distribution of 
the uninformed player’s beliefs at the second stage. 

Theorem 3.1 (Gensbittel (2015), Proposition 3.5). For a game P n with finite /, J, and 
K the following recurrent relation holds for any iV > 0 (by convention, Vq = 0) 

(3.1) Gjv+i(p) = T[Vn]{p) = sup [Vi(P) + EV)v(p(2))] , 

where V E A(A(JF)) is the distribution of p^‘^\ and VN'fP) is the value of the auxiliary 
N'-stage game with partial information on the side of Player 1 defined below. 

The game Gn'{P) is a version of Pat/ where Player 1 is not fully informed of k but 
receives a noisy signal such that his believes about k are P-distributed. It can be viewed 
as a usual game with incomplete information with A(iF) as a set of states, P as a prior 
distribution, and as a stage payoff function. By the min-max theorem 

Vi has the following representation 

(3.2) Vi(P) = min E 

where p^‘^'> ~ P, i.e., is P-distributed. 

3.2. Non-existence of small profitable revelations. The introductory section con¬ 
tains informal reasoning explaining that the \/]V-behavior of the value is related with the 
opportunity for Player 1 to reveal his information by means of small benehcial portions. 
The purpose of this subsection is to formalize the opposite situation. 

We say that in an almost-fair game Pjv Player 1 has no small profitable revelations if 
A(iF) can be represented as a hnite union of closed convex subsets A^ C A(iF), g = 1, ..Q', 
such that for any q and any P supported on Ag the value Vi(P) is zero. 

Informally, this dehnition says that, if the prior distribution p is inside Aq, then Player 1 
can not beneht at hrst stage of A-stage game without changing the belief of Player 2 
signihcantly enough by pushing outside of Aq (see (3.1)), i.e., without revealing enough 
information by his action p. 

The following proposition supports the intuition that non-existence of small prohtable 
revelations leads to bounded values. 


max 

i&I 




.i&J 
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Proposition 3.1. If in an almost fair-game Tjsf with finite /, J, and K Player 1 has no 
small profitable revelations, then for any N > 1 and any p G A(ii') 

Vn{p) < IIAIliipQ'. 

Theorem 2.1 is a corollary of Proposition 3.1 and the next lemma. Snbsections 3.3, 3.4, 
and 3.5 develop the techniqne to prove Proposition 3.1 and thereby to complete the proof 
of Theorem 2.1. 

Lemma 3.1. If a game P n fulfills the assumptions of Theorem 2.1, then Player 1 has no 
small profitable revelations with Q' = Q. 

Proof. Denote by C A(J) the set of all values that y* takes. Dehne Ag as the 

subset of all p G A(iP) such that y'^ is an optimal strategy of Player 2 in P5^^(p). Then 
A{K) = U^i Ag. The subset Ag is a closed convex polytope since this subset is cut from 

A{K) by the family of linear inequalities YhkPk AO, i G /. It remains to 

check that for any q and any V G A(Ag) we have ViifP) = 0. Indeed, playing y'^ in QiifP) 
Player 2 defends 0. This follows from (3.2) because max* is the value of 

the non-revealing game for G A^ and equals zero by almost-fairness. □ 


3.3. Monotonicity properties and the role of invariant functions. The Shapley 
operator T dehned by formula (3.1) has the following properties: 

(1) Representation of the value: Vn = T^[0] (recall that Vq = 0); 

(2) Monotonicity: if / > p, then T[f] > T[g]. 

(3) Increasing property (for almost-fair games): T[f] > f. 

Here f > g means f{p) > g{p) for all p G A{K). The hrst two items immediately follow 
from (3.1). To prove the third item we take in (3.1) a constant martingale = p. For 
such martingale V = 6p, where 6p is the Dirac (5-measure at p. The game Gi{Sp) can 
be identihed with the non-revealing game Pi(p) (see Gensbittel (2015) [18]). Hence by 
almost-fairness Vi{6p) = 0 that implies the third item. 

Using the following observation, De Meyer and Marino proved that the discrete market 
model has bounded values. 

Lemma 3.2 (De Meyer, Marino (2005) [8], Lemma 4.3). IfT has the properties (1) and 
(2) mentioned above, h>0, and T[h] = h, then Vn <h for any N >1. 

Indeed, Vn = T^[0] < T^[h] = h as 0 < h. 

In order to prove Proposition 3.1 we need to construct such h for any almost-fair game 
where Player 1 has no small prohtable revelations. We use the Kantorovich metric for 
that. 
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3.4. The Kantorovich metric. Let {Xq, do) be a compact metric space. The metric do 
induces the Kantorovich^ metric di on Xi = A(X) by 


di(p',/) = 


inf 

x'' 


E[do{x',x ")], 


where infimum is taken over all joint distributions of x' and x" with marginals p' and p", 
respectively. This makes Xi a compact metric space, and the definition can be iterated 
to define X 2 = A(Xi) and, more generally, X^ for each n. 

The dual way to define di is by the Kantorovich-Rubinstein formula 


di{p',p'')= sup [ f{x){dp'{x)-dp''{x)), 

where supremum is over all real-valued functions / that are 1-Lipshitz with respect to 
do (see Villani (2008) [33], 5.16) Hence for a Lipshitz function g on Xq the integral 
/xo dp{x) is Lipschitz with respect to di as a function of p with the same constant. 
Let us come back to games. Denote by do the discrete metric on K defined by 

dnik' k") = I ^ 

It induces the Kantorovich metric di that in this case coincides with the total-variation 
distance di{p',p") = maxscK \p'{B) —p''{B)\. In turn, di induces the Kantorovich metric 
d 2 on A(A(iL)). 


Lemma 3.3. For any game T n with finite I, J, and K 
(3.3) |Vi(P') - Vi(P")| < ||A||npd2(P',n, 

where ||A||iip is from Theorem 2.1. 

This result with a constant 2maxjj^fc instead of ||A||iip is proved by Gensbit- 
tel (2015) [18], Proposition 2.1. For Zamir’s game (Example 1.1) we have 2 maxjj^fc \^ij\ ~ 
6 and ||H||iip = 1. 

Proof. The one-stage payoff is Lipschitz in k with respect to do with a constant 
Hence HG = Jj^A^ jdp{k) is ||H||iip-Lipschitz in p with respect to di. Also for 
any y G A(J) we get max^g/ ||A||iip-Lipschitz in p. Together with (3.2) this 

implies (3.3). □ 

Remark 3.1. The above reasoning extends to the case of uncountable compact metric 
space {K,d) and || A||iip-Lipschitz payoffs by putting d instead of do in the proof above. 
Estimate (3.3) can be also generalized to the case of Vat with arbitrary N. Indeed, 
J2n=i^^n,jn A|| A||iip-Lipschitz function of p for any fixed sequence of actions, and 
taking expectation or minimum/maximum with respect to side variables does not change 
the Lipschitz constant. 


^Sometimes this metric is called the Wasserstein distance (named after Leonid Vaserstein). Note that 
Kantorovich introduced this metric to study optimal transportation problems 27 years before Vaserstein 
used it in dynamical system context (see the discussion by Vershik (2013) [32]. 
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3.5. Construction of the invariant function and the end of proof. The construc¬ 
tion of the invariant function h of T is based on the next lemma. 


Lemma 3.4. Let T^r be an almost-fair game with finite /, J, K and without small profitable 
revelations (i.e., it satisfies the assumptions of Proposition 3.1). Then for all q = I, ..Q' 
andV e A(A(K)) 

(3.4) Vi(P) < \\A\\n^Ep^pdfip,Ag), 

where the distance from a point to a set is defined in the usual way as dfip, Ag) = 
infp.eA, di{p,pq). 

Proof. Since for any V' supported on we have Vi(P') = 0, the Lipshitz property (3.3) 
implies 

Vi(P)<||A||Hpd2(P,P')- 

Let R : A{K) —> A{K) be a continuous selection of argmiUp/gA, [di{p,p')]. Picking V' 
equal to the push-forward of "P by i? leads to d 2 {V,V') = Ep^^-pdfip, Ag) and completes 
the proof. The intuition behind such a choice of V' is that we want to transport the 
“portion of P” at each p to the closest p' G A^, and one can show that this choice is the 
optimal one. □ 


Lemma 3.5. Under the assumptions of Proposition 3.1 

Q' 

Kp) = Plllip 5^(1 - dl(p, Ag)) 

q=l 

defines a non-negative invariant function ofT. 


Proof. The total-variation metric di is bounded by 1, and, therefore, h{p) is non-negative. 
Consider T[h]{p) for some p from A^u, ta = 1, ..Q'. Fix a martingale p^'^-T g Mp. From 
Lemma 3.4 and the dehnition of h we get 


Vi(P)+Eh(p(2)) < IIAIIupE 


Q' 

A„) + ^(1 - A,)) 

5-1 


(3.5) 


= IDIliipE 


q^w 


Note that the total-variation distance to a convex set is convex. Since Ep*^^) 
inequality applied to (3.5) implies 


Vi(P) + E/i(p(2))<p||j.^ 


Q' \ 

l + ^(l-di(p,Ag)) j 

q^w / 


Since di(p, A^) = 0 we can return 1 = 1 — di(p, A^,) into the sum and get 


Q' 

Vi(P) + Eh(p(2)) < ||A||iip ^(1 - cii(p, Ag)) = h{p). 

q=l 


p, Jensen’s 
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Taking maximum over we obtain T[h]{p) < h{p), and so T[h] < h. But the 

increasing property of T says that T[h] > h. Thus T[h] = h. □ 

Now Proposition 3.1 becomes a combination of Lemmas 3.2 and 3.5. This also completes 
the proof of Theorem 2.1. 

Remark 3.2. Let us check that the statement of Theorem 2.1 remains valid for games 
with a compact metric space iP, hnite /, J, and continuous A. Along the lines of the 
proof of inequality (2.1) we show existence of the value Vn for such a game. 

Fix (5 > 0. By uniform continuity of A we hnd e-net of K such that for any k E K 
and any i,j we have 

(3.6) 

where G JFe denotes a point nearest to k. Enumerate points in and make k^ single¬ 
valued by selecting the point with the smaller number in case of a tie. Let G be 

the distribution of k^{k) if k is distributed according to p. Inequality (3.6) implies that 
for any strategy a of Player 1 the guaranteed payoffs in PAr(p) and in PAr(p^) differ at 
most by N5. The same holds for any strategy r of Player 2. The game PAr(p^) can be 
considered as a game with hnite set of states hence has a value V^ip^) that does not 
exceed Q|| A||iip by Theorem 2.1. For lower and upper values of FAr(p) we have 

Ew(/) -N5< VM < ^n(p) < fAr(p") + N5< Q||A||hp + N5. 

Since 5 is arbitrary, the value exists and the inequality (2.1) holds. 


4. Small profitable revelations, simple random walks, parametric 
FAMILIES OF MATRIX GAMES, AND PROOF OF THEOREM 2.2 


The main tool in the proof is a representation for Vn as the value of a martingale- 
optimization problem from Proposition 3.5 of Gensbittel (2015) [18] (dehnitions of Mp 
and Vi are introduced in Subsection 3.1 where we already formulated another version of 
his result): 


(4.1) 


^v(p) = niax E 

p(n>l) 


N 




n=l 


where Vn is the conditional distribution of pG+i) given p^^\p^^\ ..pG)_ 


Remark 4.1. Starting from an arbitrary martingale pG>i) one can explicitly construct a 


EtlMVn) 


in P 7 v(p), see Gensbittel (2015) [18]. 
[2010) [9] where it was formulated in 


strategy of Player 1 that guarantees E 

This idea together with (4.1) comes from De Meyer 
the context of market models. 

In particular, a strategy that gives to Player 1 an expected gain of the order of \/N 
can be extracted from our proof of Theorem 2.2. 


We begin with formalization of what does it mean that Player 1 can beneht by revealing 
an arbitrary small amount of information. Intuition mentioned in the introduction for 
the continuous market model connects this property with \/]V-behavior. So does Propo¬ 
sition 4.1 below. It is proved in Subsection 4.2 using formula 4.1 and explicit construction 
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of a martingale that provides \/]V-lower bound on Vn{p)- In Subsection 4.3 we check that 
the assumptions of Theorem 2.2 imply existence of small prohtable revelations. 


4.1. Small profitable revelations imply \/]V-behavior. In an almost fair game Player 1 
is said to have small profitable revelations if for some p\p'' G A{K), p' ^ p", there exists 
Ca > 0 such that for any Oi, 02 G [ 0 , 1 ] 


Vi 


1 1 


> Ca|<ti ~ CI2I, 


where p{a) = ap' + (1 — a)p'', and 5p G A(A(iP)) is the Dirac ^-measure at p. 

In other words, Player 1 has small prohtable revelations if at hrst stage of A-stage game 
by an arbitrary small change of the uninformed player’s belief (from p^^l = p to 

p( 2 ) = p{ai) or to p^^'> = p{a 2 ) equally likely) he can get a proht of the order of this 
change. 


Proposition 4.1. If in an almost-fair game P^r with finite I,J, and K Player 1 has 
small profitable revelations, then Vv(p) is of the order of y/N for any p G 

This proposition is proved in the next subsection. Together with the next proposition 
it implies Theorem 2.2. 


Proposition 4.2. Under the assumptions of Theorem 2.2 Player 1 has small profitable 
revelations. 


We prove Proposition 4.2 in Subsection 4.3. 


4.2. Simple random walks and proof of Proposition 4.1. In order to prove Propo¬ 
sition 4.1 we dehne a martingale from that ensures \/]V-lower-bound in (4.1) 


by 



^■(nATjv) \ 

Ay/N J ’ 


where is the simple random walk over Z starting from 0 at time n = 1, stopping 

time Tjv is the minimal n such that > 2y/N — 1, and A denotes taking minimum. 

As above, p{a) = ap' -f (1 — a)p". 

The following lemma says that makes enough jumps before time A -|- 1. 


Lemma 4.1. P({rAr > A}) > ^ for any A > 1. 


Proof. Let us estimate P({rAr > A}) = 1 — P ^{max„=i_,,Ar >2\/A —l}j from 

below. 

For any square-integrable martingale A 7 ^ 0, and A > 1 the maximal Doob 

inequality holds (see Revyz, Yor (1999) [28], Corollary (1.6) from Chapter II); 


P 


max 

n=l,..N 


>x}]< 


E 




A 2 
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The simple random walk is a martingale, and hence by Doob’s inequality 


P 


max 

n=l,..Af 


> 2\/iV- 1 


< 


N -1 


< 


□ 


(2\/]V- 1)2 ' 2 

Thus P({rAr > N}) > I that completes the proof of the lemma. 

Proof of Proposition f.l. First, let us prove the result for p = p (|). Using the constructed 
martingale pU>i) in formula (4.1) we get 


Uv p 


> E 


N 




n=l 


N 

> ^ E [Vi(Pn) \tn > n]F {{tn > n }). 

n=l 


If the martingale has a jump at time n, i.e., if > n, then the conditional distribution 
Vn of given p’^^^ ..p’^"'^ with p^”^ = p(a) is equal to __Since 


4\/jV 


Player 1 has small prohtable revelations E [Vi(Pn) \ > n] > 


. Therefore 


N 


W I p I U) > > U) > ^^/jVP({r^, > N}) > 


(in the last inequality we applied Lemma 4.1). This gives the result for p = p 

Now consider an arbitrary prior distribution p from the relative interior A''®'“*(7F) of 
the simplex. We need two observations: 

• For almost-fair games 4^(5*,) = 0 for any k. Indeed, for such prior both players 
know that the game is N times repeated matrix game that can be identihed 
with F^^((jfc) and so has zero value. 

• Vat is a concave function of p (see Mertens, Sorin, Zamir (2015) [26], Section V.I., 
or Gensbittel (2015) [18]). 

Any p G A’^®*“*(iF) can be represented as a convex combination p = /3p (4) + J2k&K 
with (3 > 0. The maximal possible (3 equals 1 — di {p,p{\)), where di denotes the 
total-variation distance (di is dehned in Subsection 3.4). Concavity leads to 

Uv(p) > ^Vn (^P Q^^+^/3fc'14r(4) = ^Vn (^P > (^l-di (^p,p 

Thus Proposition 4.1 is proved. □ 


4.3. Parametric families of matrix games and proof of Proposition 4.2. The 

proof of Proposition 4.2 is based on two lemmas. The hrst one says that we can hud 
a subinterval where the unique optimal strategy y* = p*(a) G A(J) of Player 2 in the 
non-revealing game Pf^(p(a)) depends on a strongly enough. Let us formulate this result 
more rigorously. 

Lemma 4.2. Under the assumptions of Theorem 2.2 there exists a constant C > 0 and 
0 A •Tmin ^ •Tmax A 1 SUch that for any Oi, 02 £ [•Tmin; Q^max] 

(4.2) \\y*{ai) - y*{a 2 )\\ > C\ai - 02 !, 

where || • || denotes the Euclidean norm. 
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Proof. Consider an arbitrary matrix game A. Let y* be an extreme point of the set of 
Player’s 2 optimal strategies. Denote by the vector of its non-zero components. The 
classical result of Snow and Shapley (see Karlin (1959) [21], Theorem 2.4.3) says that there 
exists a square submatrix M oi A such that y*^^ = ^e^adj(M)e) non-zero denominator. 
Here e denotes the vector of all ones, (•, •) is the standard scalar product, and adj(M) 
is the adjugate of M, i.e., adj(M)ij equals the (j,7)-cofactor of M. In particular y* is a 
rational function of entries of M. 

The result of Snow and Shapley implies that, if a matrix game A = A{a) is an affine 
function of a parameter a and the optimal strategy y* = y*{a) of Player 2 is unique for all 
a, then y* is a piecewise-rational function of a with hnitely many domains of rationality 
(since the number of square submatrices of A is hnite). 

In Theorem 2.2 it is assumed that y*{(y) takes inhnitely many different values. Therefore 
there is an interval D C [0,1] such that y*{(y) is a non-constant rational function on D. 
Hence there is a point in the interior of D where -^y* 7^ 0. In a small annulus [amin, <Tmax] 
of this point y* is close to affine function with non-zero slope. This implies (4.2). □ 

The next lemma says that losses of Player 2 from playing non-optimal mixed strategy 
in the non-revealing game are proportional to the distance between the strategy used and 
the optimal one. This result is similar to Lemma 4.3 from De Meyer (1996A) [2]. We 
borrow the idea of the proof from his paper. 

Lemma 4.3. Under the conditions of Lemma f.2 there exist an interval [«min) ®max] 
[ttmin, IT max] of non-zcro length and a constant C' > 0 such that for any y G A( J) and any 

(y Iry^ . 0 / 1 

^ ^ L mm’ ^maxJ 

(4.3) ms^^A^^fyj > C'\\y*{a) - y\\. 

Proof. Let Io{a) be the set of alH G / such that A^^^'^y*{a) = 0. Find a subin¬ 

terval [aCn, iT(nax] [iTmin, Tmax] such that Io{a) aud the support J*{a) of y*{a) remain 
the same when a ranges over the subinterval. Such subinterval exists since y*{a) is a 
continuous function of a (indeed, if a as m —)■ 00 and y = lim^^oo I/* (ctm), then y 

is an optimal strategy in A^A^ and hence y = y*{a) by uniqueness). The left-hand side 
of (4.3) is bounded from below by f{a,e) = maxjg/g(a) where e = y — y*{a). 

Dehne D{a) = {e: G | e = (?/ — y*{a))/\\y — i/*(a)||, y G A(iP) \ {i/*(a)}} and note 
that D{a) is a compact set and does not depend on a for a G [<Tmin) Q^max]- ^ func¬ 
tion / is a homogeneous function of e for any a. Therefore it is enough to show that 
C = minjc,/. f{ct,^) > 0. Since / is continuous, this minimum exists an is 

attained at some {a*,e*). If C <0, then for A > 0 small enough we would have 
maXjgT -'^yj < 0 for y = y*{a*) -|- Xe* that contradicts the uniqueness of the 
optimal strategy of Player 2 in P5^^(p(a*)). This contradiction completes the proof. □ 

Now Proposition 4.2 can be proved easily. 

Proof of Proposition f.2. Consider the game QiifP) with V = + |<5p(a2)- This game 

can be interpreted as follows. Chance selects one of two matrix games or A^A^l 
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equally likely. Then the game selected is played, but only Player 1 knows the choice. We 
assume that ai,a2 G [®min)®max]) where and come from Lemma 4.3. Suppose 
Player 2 uses a strategy y G A(J). Then Lemmas 4.2 and 4.3 prevents him from being 
successful in both and More formally, by playing y he defends (see (3.2)) 


lEp(2)..^-p 


max 

i&I 




j&J 


1 

- max 
2 iei 




1 

H— max 
2 i&i 




P(«2) 




Lemma 4.3 implies that this amount is bounded from below by 

Ic'llrUi) - !/|l + \c'\\y'(a^) - »|| > \c'\\y-‘{a,) - !/-(a 2 )||. 

Here we applied the triangle inequality. By Lemma 4.2 this quantity is greater than 
CC'\ai — Q!2|/2. Since y was arbitrary we get 


Vi 


1 1 


> —CC^Ui ~ U2I 


This inequality is equivalent to existence of small prohtable revelations with redehned p' = 
p(aTn), p” = p(Umax)! ^^^1 Ca = n, , , —T and concludes the proof of Proposition 4.2. 

^l^max ^min/ 

□ 


5. What is next? 

The results of this paper raise more questions than answers: 

• The piecewise property can be easily checked for any particular repeated game 
because it appeals only to a parametric family of matrix games. But can one 
describe piecewise games more explicitly? How does one construct examples of 
such games? The only way we know is to start from discrete market game and to 
modify it somehow. 

• Here we obtained that the sequence of values Vai of Wstage almost-fair piece- 
wise game converges to some finite limit Hoc as —?> oo. This suggests one 
to consider an inhnite-stage version Pqo of such game. Do players have optimal 
strategies in Poo? For discrete market games the positive answer with explicit 
construction is given by Domansky (2007) [11]. In particular, he obtained that 
in the inhnite market game Player 1 (behaving optimally) will reveal his private 
information in a hnite time. Do we have such a counterintuitive effect for all 
almost-fair piecewise games? What are the properties of Hoo? For market games 
it is a piecewise-linear function of p, see the paper of Domansky and the one 
of De Meyer and Marino (2005) [8]. What is the speed of convergence? San- 
domirskaia (2016) [29] showed that the speed is exponential for the discrete market 
model. 

• The role of uniqueness assumption in Theorem 2.2 should be clarified. For exam¬ 
ple, are there any almost-fair games with Viv growing to inhnity but slower than 
vbV? Does there exist a non-piecewise almost-fair game with bounded values? 
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• Existence of the limit VN/y/N as iV —)■ cx) for games with Vn of the order of 
y/N is an important open question. For the widest class of games considered in 
the literature, the result is conditional: existence is proved under the assump¬ 
tion that a certain partial-differential equation has sufficiently regular solution 
(De Meyer (1996A) [2]). 

• To hnd a proper generalization of Theorem 2.1 to inhnite I and J and of Theo¬ 
rem 2.2 to inhnite I, J, K seems to be interesting. So are generalizations of the 
theorems to games that are not almost-fair or have incomplete information on 
both sides. 

We plan to discuss some of the questions raised in subsequent publications. 
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